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ABSTRACT 

Aims. The initial cluster mass function (ICMF) in spiral discs is constrained and compared with data for old globular clusters and 
young clusters in starbursts. 

Methods. For a given absolute magnitude, the cluster age distribution depends on the ICMF. Here, the behaviour of the median age- 
magnitude relation is analysed in detail for Schechter ICMFs with various cut-off masses, M c . The calculated relations are compared 
with observations of the brightest clusters in spiral galaxies. Schechter functions are also fitted directly to observed mass functions 
(MFs). 

Results. A single Schechter ICMF with an M c of a few times 10 5 M o can reproduce the observed ages and luminosities of the 
brightest (and 5 th brightest) clusters in the spirals if disruption of optically visible clusters is dominated by relatively slow secular 
evolution. A Schechter function fit to the combined cluster MF for all spirals in the sample yields M c = (2.1 ± 0.4) x 10 5 M o . The 
MFs in cluster-poor and cluster-rich spirals are statistically indistinguishable. An M c = 2. 1 x 10 5 M o Schechter function also fits the 
MF of young clusters in the Large Magellanic Cloud. If the same ICMF applies in the Milky Way, a bound cluster with M > 10 5 M Q 
will form about once every 10 7 years, while an M > 10 6 M o cluster will form only once every 50 Gyr. Luminosity functions (LFs) of 
model cluster populations drawn from an M c = 2.1 x 10 5 M o Schechter ICMF generally agree with LFs observed in spiral galaxies. 
Conclusions. The ICMF in present-day spiral discs can be modelled as a Schechter function with M c ss 2 x 10 5 M o . However, the 
presence of significant numbers of M > 10 6 M o (and even M > 10 7 M o ) clusters in some starburst galaxies makes it unlikely that 
the M c value derived for spirals is universal. In high-pressure environments, such as those created by complex gas kinematics and 
feedback in mergers, M c can shift to higher masses than in quiescent discs. 
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1. Introduction 

Star clusters connect many branches of astrophysics. Classically, 
they have been important test benches for models of stellar evo- 
lution (Maeder & Mermilliod 1981). A large fraction of all star 
formation occurs in clusters (Lada & Lada 2003), and massive, 
long-lived globular clusters (GCs) are potentially useful tracers 
of stellar populations in their parent galaxies (Brodie & Strader 
2006). It is therefore of considerable interest to understand how 
the properties of star clusters are related to those of their parent 
galaxy, and in particular whether the formation (and survival) 
of GC-like objects requires special conditions. Three important 
properties of cluster populations which bear on this issue can, to 
some extent, be studied separately: 1) the initial cluster mass 
function (ICMF), 2) the formation efficiency of (bound) star 
clusters (Larsen & Richtler 2000; Bastian 2008; Parmentier & 
Fritze 2008), and 3) dynamical evolution and cluster disruption, 
addressed both theoretically and observationally in many stud- 
ies (Vesperini & Zepf 2003; Baumgardt & Makino 2003; Lamers 
et al. 2005; McLaughlin & Fall 2008; Gieles & Baumgardt 2008; 
Whitmore et al. 2007). All three may, in principle, depend on 
time and location within a galaxy. This paper is concerned with 
the mass function, primarily its high-mass end and the possible 
existence of an upper cut-off mass. 
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Highly luminous, compact star clusters are observed to form 
in on-going starbursts such as those associated with major gas- 
rich mergers and, in smaller numbers, in many other exter- 
nal galaxies with elevated star formation rates (e.g., Larsen 
2006). However, conversion of observed cluster luminosities to 
masses requires knowledge of the mass-to-light ratios which, in 
turn, are strongly age-dependent. Dynamical mass estimates via 
integrated-light measurements of velocity dispersions have been 
obtained for only a limited number of clusters, but have con- 
firmed that young clusters with masses above 10 5 M Q (and, in 
some cases, 10 6 M o ) are quite common in external star-forming 
galaxies (Ho & Filippenko 1996; Larsen et al. 2001, 2004; 
Larsen & Richtler 2004; McCrady & Graham 2007; Smith & 
Gallagher 200 1 ; de Grij s & Parmentier 2007) . On the other hand, 
most young star clusters known in the Milky Way are relatively 
sparse with masses of 10 2 - 10 4 M Q . This apparent contrast be- 
tween the properties of Milky Way open clusters and the lumi- 
nous star clusters encountered in some external galaxies raises 
important questions about the universality of the cluster forma- 
tion process. Do the most massive clusters form preferentially in 
starburst environments for some physical reason, or do they just 
happen to be statistically more likely to form there because the 
total numbers of stars and clusters formed are very large? 

Most studies of the statistical properties of star cluster pop- 
ulations have concentrated on luminosity functions (LFs), which 
are more readily derived for large samples of clusters than mass 
functions (MFs). There is a roughly linear relation between the 
luminosity of the brightest cluster in a galaxy and the total num- 
ber of clusters, as expected if luminosities are drawn at ran- 
dom from a power-law distribution with slope « -2 (Billett 
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et al. 2002; Larsen 2002; Weidner et al. 2004; Whitmore 2003; 
Whitmore et al. 2007). This implies that if the LF has a physical 
upper limit, no cluster system studied to date is sufficiently rich 
to sample the LF up to that limit, and the (young) cluster popu- 
lations in most galaxies might in fact be drawn from very similar 
underlying LFs. 

Clearly, the MF would be a physically more interesting prop- 
erty to study than the LF. The LF generally depends on both 
the mass- and age distribution of the cluster sample in question 
(e.g. Fall 2006), and since the latter is often unknown, the ob- 
served LFs offer only limited insight into the underlying MFs. 
If the goal is to obtain information about the initial MF, disrup- 
tion effects must also be taken into account. In the small num- 
ber of cases where MFs have been determined, they are gen- 
erally found to be consistent with power-laws dN/dM oc M a 
with a slope a ~ -2 (Bik et al. 2003; de Grijs et al. 2003; 
McCrady & Graham 2007; Zhang & Fall 1999; Dowell et al. 
2008). However, the mass ranges differ from one study to an- 
other (e.g. 2.5 x 10 3 < M/M Q < 5 x 10 4 for the Bik et al. study 
of NGC 5194, and 10 4 < M/M < 10 6 for the Antennae clusters 
studied by Zhang & Fall), so it is not clear that the similarity of 
the slopes derived for different galaxies actually implies a uni- 
versal ICMF 

Even if the MF can typically be fit by a power-law, this might 
only apply over a limited mass range. One potential difficulty 
is the large number of very massive clusters predicted in the 
Milky Way for a uniform power-law MF. For a star formation 
rate in bound star clusters of 5.2 x 1O~ 1O M yr~' pc~ 2 in the 
Solar neighbourhood (Lamers et al. 2005, hereafter LAM05), 
the Milky Way disc would have formed about 100 clusters with 
M > 1O 6 M over a 10 Gyr lifetime, assuming that the cluster for- 
mation rate was constant in the past (probably a conservative as- 
sumption; e.g. Fuchs et al. 2008). These are not observed. Since 
the completeness of existing surveys of young Milky Way clus- 
ters remains extremely poorly quantified it cannot be excluded 
that some remain hidden behind massive extinction, and others 
may have dissolved (Sect. 2.2). Another possibility is that they 
simply never formed, or at least in smaller numbers than pre- 
dicted by extrapolation of a pure power-law MF. 

An upper limit to the ICMF need not be a strict cut-off, but 
could also take the form of an exponential decline above some 
characteristic mass M c , i.e. a Schechter (1976) function: 

dN 

— oc (M/M c f exp (-M/M c ) , (1) 
dM 

According to Bastian (2008), the relation between the luminos- 
ity of the brightest cluster in a galaxy and the star formation rate 
(SFR) can be reproduced if clusters in all galaxies are drawn 
from a Schechter function with M c = (1 - 5) X 10 6 M o . More in- 
direct evidence of a truncation or steepening of the MF in young 
cluster systems comes from the detection of a bend in the LF 
of clusters in NGC 5194 (M51), NGC 6946 and the Antennae 
galaxies (NGC 4038/39), which is consistent with a power-law 
MF truncated at a mass of 5 x 10 5 - 10 6 M o (Zhang & Fall 1999; 
Gieles et al. 2006a). 

The aim of this paper is to examine, for a larger sample 
of "normal" spiral galaxies, the behaviour of the cluster MF 
at its high-mass end and constrain the upper mass limit, M c . 
The most direct way to do this would perhaps be to simply ob- 
tain luminosities and ages for a large sample of clusters in each 
galaxy and converting these to masses, ideally using Hubble 
Space Telescope (HST) imaging to obtain clean cluster samples. 
Although a large amount of data for nearby galaxies have by now 
accumulated in the HST archive, many datasets suffer from two 



basic shortcomings: 1) they lack (deep) imaging in a (roughly) 
f/-band equivalent filter, which is essential for age-dating of the 
clusters, and 2) they cover only a small fraction of the galaxy. 
The latter limitation is particularly problematic when studying 
the most massive clusters, which tend to be rare. The only spiral 
which is sufficiently nearby to take advantage of HST's imaging 
resolution for cluster identification and is fully covered by multi- 
passband HST imaging including the ultraviolet is NGC 5194 
(Scheepmaker et al. 2007; Hwang & Lee 2008), which is ar- 
guably not a "normal" spiral due to its on-going interaction with 
NGC 5195. 

Instead of studying the complete MF in detail for each in- 
dividual galaxy, we begin by taking an approach similar to that 
adopted in several previous studies of the LF: from the corre- 
lation between the magnitude of the brightest cluster and some 
estimate of the total cluster population, one can test whether the 
data are consistent with random sampling from a single, univer- 
sal LF. Here, we go one step further and also attempt to draw 
conclusions about the MF from observations of the brightest 
(and 5 th brightest) clusters. As will become clear below, the cru- 
cial extra information that makes this possible are the cluster 
ages, but because these are needed only for the brightest clus- 
ters, ground-based data are generally adequate. 

The general outline of the paper is as follows: In Sect. 2 we 
determine the relation between the median age and luminosity 
of the brightest and 5 th brightest clusters (or, in fact, any sam- 
ple with magnitudes in a fixed, narrow range) for a given sample 
size, cut-off mass M c , and age distribution. This statistical age- 
luminosity relation should not be confused with the general fad- 
ing of an individual star cluster that is the result of stellar evolu- 
tion and cluster dissolution. The (log(age), My) predictions are 
compared (in Sect. 2.5) with previously published ground-based 
data for clusters in a sample of nearby spiral galaxies in order to 
put constraints on M c . In Sect. 3, a direct Schechter function fit to 
observed MFs is performed and yields M c = (2.1 ±0.4)x 10 5 M o . 
In Sect. 4 the properties of the LF for a cluster sample drawn 
from an M c — 2. 1 x 10 5 M o Schechter MF are compared with ob- 
served LFs. In Sect. 5 the implications of an M c = 2.1 x 10 5 M o 
Schechter MF are discussed in the context of other cluster sys- 
tems, and it is concluded that while such a MF is consistent with 
existing data for Milky Way and Large Magellanic Cloud clus- 
ters, it is unlikely to be universal. Finally, in Sect. 6 the results 
are discussed in the context of host galaxy properties and current 
views on star/cluster formation. 

2. Constraining the upper limit of the MF via ages 
and luminosities of the brightest clusters 

2.1. Uniform age distribution and a truncated power-law 
mass function 

As a first illustration of the statistical behaviour of the upper 
mass- and luminosity boundaries of a cluster population, clusters 
with masses between 10 4 M and 10 6 M G were drawn at ran- 
dom from a power-law mass distribution, dN/dM oc M~ 2 . Each 
cluster was assigned a random age between 5 Myr and 1 Gyr. 
Luminosities corresponding to the age and mass of each cluster 
were calculated using solar-metallicity simple stellar population 
(SSP) models from Bruzual & Chariot (2003, hereafter BC03). 
These models will be used throughout this paper and assume 
a Salpeter-like (dN/dM oc M~ 235 ) stellar initial mass function 
(IMF) for 0.1 <M/M Q < 100, which probably overestimates the 
number of low-mass stars. For more realistic Kroupa (2002) or 
Chabrier (2003) IMFs, the mass-to-light ratios (and correspond- 
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Fig. 1. Simulated absolute magnitude Mv versus log(age) plots for clus- 
ter systems of increasing richness. A power-law mass function truncated 
at 10 6 Mr, is assumed. 



ing masses quoted in this paper) would be smaller by a factor of 
about 1.5. 

Figure 1 shows the resulting (log(age), Mv) plots for sam- 
ples of N - 10 2 , 10 3 , 10 4 and 10 5 clusters (note the use of cal- 
ligraphic letters Mv in this paper to distinguish absolute mag- 
nitudes from masses). The dashed line in each panel is the fad- 
ing line of a 1O 6 M cluster, the maximum possible brightness 
at a given age for the truncated power-law MF assumed here. 
For a Schechter function, there would be no strict upper limit. 
While the effects of cluster disruption are ignored in this plot, 
it serves to demonstrate a couple of important points. First, al- 
though the mass function is sampled up to near the cut-off mass 
in all four cases, this is not the case for the luminosity function. 
The brightest Mv magnitude possible for a 10 6 M cluster is 
M™ 1 = -14.5, but only the two most populous samples get 
close to this value. In other words, tlie maximum cluster mass in 
each sample is close to the physical limit set by tlie MF, but the 
maximum brightness is determined by random sampling. This 
holds true over several orders of magnitude in N. Second, the 
brightest cluster tends to be younger in the more populous clus- 
ter systems. For a poor cluster system, rapid fading reduces the 
probability that a very massive cluster happens to be caught in 
the short time interval after its birth when it is most luminous. 
As the number of clusters is increased, it becomes increasingly 
likely to encounter a young cluster with a mass near the physical 
upper limit and a luminosity that cannot be matched by an older 
object. Thus, the mean or median age of the brightest cluster 
shifts towards younger values. 

The specific age-luminosity relation for the brightest cluster 
will depend on the actual upper mass limit, whether this is a 
strict cut-off as in Fig. 1 or the M c of a Schechter function. It 
therefore potentially offers a diagnostic of the upper limit. This 
will be discussed in further detail below (Sect. 2.3). 



2.2. Cluster disruption 

The simple experiment described in the previous paragraph suf- 
fers from a number of shortcomings. In particular, the relation 
between the age and magnitude of the brightest cluster will de- 
pend on the age distribution of the cluster population. This will, 
in general, be affected by variations in the star (cluster) forma- 
tion rate as well as by disruption. 

A typical open cluster in the Milky Way will gradually evap- 
orate and dissolve completely within a few times 10 8 years 
(Wielen 1971), due to the combined effects of stellar evolution, 
internal two-body relaxation and external perturbations (Spitzer 
1958, 1987; Terlevich 1987; Lamers & Gieles 2008). This "secu- 
lar evolution" (Parmentier & Fritze 2008) causes a cluster to lose 
mass at a (roughly) constant rate, so that lifetimes are in general 
expected to scale with cluster mass. The exact relation between 
lifetime and mass will also depend on the structural parameters 
of the clusters, the relation between half-mass radius and mass, 
and the environment in which they find themselves. According 
to LAM05, a useful fitting formula is 



(Mi 

fdis " t4 \i¥M~ e 



(2) 



where the parameter y has a value of about 0.62 and fdis is the dis- 
ruption time scale of a cluster with initial mass M, . For the Milky 
Way, LAM05 find a disruption time scale for a 10 4 M Q cluster of 
?4 = 1.3 x 10 9 years, so that extrapolation of Eq. (2) leads to 
a lifetime of ~ 5 Gyr for a 10 5 M o cluster and ~ 23 Gyr for a 
1O 6 M cluster. Thus, the most massive clusters are expected to 
survive for a significant fraction of the age of the Universe in the 
Galactic disc. It should be noted that a y-value closer to unity is 
required to reproduce the MF of old GCs as a result of dynam- 
ical evolution from an initial power-law or Schechter function 
(e.g. Jordan et al. 2007). It therefore remains uncertain whether 
a universal y value applies everywhere, but for our purpose the 
exact numerical value is not important. 

Many clusters are likely born out of virial equilibrium, or 
are forced into a state of non-equilibrium once residual gas from 
the star formation process is expelled. During the ensuing vio- 
lent relaxation, clusters may disrupt completely or partially and 
undergo a phase of "infant mortality" or "infant weight loss" 
which lasts up to a few tens of Myr (Kroupa & Boily 2002; Lada 
& Lada 2003; Goodwin & Bastian 2006; Whitmore et al. 2007). 
Unlike the secular dissolution discussed above, this process is 
believed to affect clusters of all masses more or less equally. 
In the Milky Way, the Magellanic Clouds, and possibly several 
other galaxies, mass-independent disruption seems to be rela- 
tively unimportant for non-embedded clusters older than ~ 10 
Myr (Lamers et al. 2005; Gieles et al. 2007; de Grijs & Goodwin 
2008; Gieles & Bastian 2008, but see Chandar et al. 2006 for an 
alternative view regarding the SMC). For the Antennae galaxies, 
mass-independent disruption instead seems to lead to the loss of 
80-90% of the cluster population per logarithmic age bin for the 
first ~ 10 8 - 10 9 years (Fall et al. 2005; Whitmore et al. 2007). 

For secular dissolution the conclusions drawn from Fig. 1 
would not change dramatically unless the disruption time scale 
is much shorter than in the Milky Way. Even for a disruption 
time as short as t\ - 200 Myr, as derived for the inner disc of 
NGC 5194 (Gieles et al. 2005), a 10 5 M Q cluster would still sur- 
vive for about 800 Myr (although its mass would be steadily de- 
creasing over this time span). Mass-independent disruption, on 
the other hand, can have a strong impact on the age distribution 
of any subsample. In the following, both mass-dependent and 
mass-independent disruption will be considered. 
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Fig. 2. Absolute My magnitude versus age for the brightest cluster 
when sampling N clusters at random from a Schechter mass function 
with M c = 10 6 M Q . For each N, a number of realisations are shown. 
The curve is the Mv magnitude vs. age for a 10 6 M o cluster. No disrup- 
tion is included. 
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Fig. 3. Age distribution of the brightest cluster for M c = 10 6 M o and 
N = 10 4 . The histog ram is for the N = 10 4 Monte-Carlo simulation in 
Fig. 2 while the solid (blue) curve shows the exact relation according to 
Eq. (8) for M v = - 10.3 mag. 



2.3. Including cluster disruption and Schechter mass 
functions: the set-up 

Fig. 2 shows the median Mv vs. log(age) for the brightest clus- 
ter in random samples of N - 10 3 , 10 4 , 10 5 and 10 6 clusters 
with M > 5000M Q drawn from a Schechter mass function with 
M c = 1O 6 M and a = -2. In this case, no cluster disruption was 
included and the age distribution was assumed to be uniform 
over 10 10 years. As expected, Mv generally becomes brighter, 
and log(age) younger, when sampling more clusters. It is per- 
haps not quite intuitive that the Mv and age of the brightest clus- 
ter should be uncorrected for fixed M c and N. This is due to the 
fact that clusters are selected based on their luminosities, so that 
the scatter in log(age) in Fig. 2 simply represents the range of 
possible ages for a given Mv- For small N the age distribution is 
very extended and the brightest cluster can be either young (and 
relatively low mass) or old and relatively massive. For large N 
the brightest cluster will generally be fairly young and massive. 
Also included in the figure is the BC03 model prediction for the 
Mv magnitude of a 1O 6 M cluster as a function of age. Since the 
Schechter function does not cut off abruptly at M c , the brightest 
cluster will be more massive than M c in some fraction of the re- 
alisations. This occurs more frequently in rich cluster systems 
where the high-mass tail of the Schechter function is more fully 
sampled over the entire age range, but can happen occasionally 
even in a cluster-poor galaxy. For the Monte-Carlo simulations 
shown in Fig. 2, the brightest cluster is more massive than M c in 
39% of the realisations for N = 10 6 , and in 7% of the cases for 
N = 10 3 . This again illustrates that while the luminosity of the 
brightest cluster is strongly dependent on N, this does not trans- 
late trivially to a similar scaling of the maximum cluster mass 
with N. 

To quantify the effects of cluster disruption, three cases are 
considered: 

- Case A: No disruption 

- Case B: No infant mortality, and secular evolution with = 
10 8 years and exponent y = 0.62. The disruption time scale 
assumed here is deliberately chosen to be much shorter (a 



factor of 10) than derived for the Solar neighbourhood, so 
as to illustrate the effect of secular disruption in an extreme 
case. In this scenario, the mass M(t) at time t of a cluster 
with initial mass M, can be approximated by 



M(t) =* Mi 



(^ev(O) 



r _ 



yt /lO 4 M Ny 



Mi 



1/7 



(3) 



where yu ev (?) indicates the fraction of mass left in the cluster 
after accounting for mass loss due to stellar evolution. This 
quantity is tabulated in the BC03 models. Eq. (3) can be in- 
verted to 



Mi 



10 4 M o 



M(t) 
10 4 M C 



yt 



/"ev(0 



(4) 



Case C: Infant mortality leading to the loss of IMR=90% of 
clusters per log(age) interval until Tj m . max = 10 8 years and no 
secular evolution. For a mass-limited sample, the surviving 
fraction of clusters / surv will be 



1 for t < to 

(T/T ) 1Og(1 - IMR) for T < T < Tj, 

(^*im,max/^"o) ^ ^ T > Ti m inax 



(5) 



where to marks the onset of infant mortality. We assume to = 
5 x 10 6 yr so / surv = [t/(5 x 10 6 years )]^ for the first 10 8 
years and / surv = 0.05 afterwards (Whitmore et al. 2007). 

Monte-Carlo simulations similar to those shown in Fig. 2 
could, in principle, be used to determine how Mv and log(age) 
for the brightest cluster depend on M c and for each disruption 
scenario. However, this can also be established more generally 
by considering the distribution of cluster ages corresponding to 
a particular absolute magnitude, Mv, or luminosity Ly. For a 
sample of clusters with fixed age t, the luminosity function will 
be 



dN 
dLv 



dN dM 
dM dL v 



(6) 
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where here M(Ly, r) is the mass of a cluster with age t and ( V- 
band) luminosity Ly. The first factor expands to 



dN _ dN dMi 
d~M ~ d~Mi~dM 



(7) 



where dN/dMj is the ICMF and dMj/dM follows by straight 
forward differentiation of Eq. (4). The second factor in Eq. (6) is 
the mass-to-light ratio, T v (t) = M(L v ,t)/L v . Representing the 
ICMF by iff(Mi), we can then write 



-p?- = ip(Mi) i^A T v (t) CFR(r) / surv (r) 
drdLy \dM j 



(8) 



where CFR is the cluster formation rate (of bound clusters) and 
/survCr) is the survival fraction after applying mass-independent 
disruption. It is assumed here that T only depends on t so that it 
can be computed from standard SSP models, but a mass depen- 
dency (e.g., Kruijssen 2008) could in principle also be included. 
In general, the ICMF should be normalised to unit mass per unit 
time over some range of initial cluster masses M\ < M, < M up . 

Fig. 3 compares the age distribution for the Monte-Carlo 
simulation for M c = 1O 6 M and N - IQ 4 (Fig. 2) with the exact 
expression (8), scaled to match the Monte-Carlo data. We have 
used My = -10.3, which is the median absolute magnitude of 
the brightest cluster for this M c and N. The Monte-Carlo sim- 
ulations and exact expression show excellent agreement within 
the Poisson errors. From Eq. (8) it is straight forward to calcu- 
late statistics such as the mean or median age of clusters with 
a given luminosity, as well as the standard deviation, mean ab- 
solute deviation, etc. These statistics will apply to any cluster 
sample selected by luminosity, i.e. not only the brightest clus- 
ter, but also the 5 th brightest, or clusters with a fixed magnitude. 
In addition, the integral over all r gives the luminosity function, 
dN/dLy 

2.4. Predicted median (log(age), My) relations for the 
different disruption scenarios 

The median (log(age), Mv) relations for the three disruption 
scenarios are shown in Fig. 4. The relations have been cal- 
culated for Schechter functions with or = -2 and M c = 
10 4 M G , 10 5 M G , 10 6 M o and 10 7 M o (solid lines) as well as for 
M c = 3 x 10 5 M o and M c = 10 15 M G (dotted lines). The latter 
is essentially an untruncated power-law with a = -2. An age 
range of 5 Myr - 10 Gyr and a constant CFR are assumed in all 
cases. In addition to the constant-M c curves, lines for constant 
number of clusters («e) brighter than Mv = -6 are drawn with 
(red) dashed lines. Note the use of lower-case n to distinguish 
this from the number of clusters above a particular mass limit, 
N, which is more difficult to relate directly to observations. The 
constant-«6 curves were determined by first finding a normalisa- 
tion constant no for the luminosity functions so that 



r Mv= - b dN 

«6 = «0 ~T7T~ d Mv 

Jai v =-oo dM v 



and then solving 

n Mv<br 



= n Q 

J j\.\y--Ct 



dN 



dM v 



)Mv=-o° dMv 
for the magnitude of the brightest cluster, My and 

«M v <5th = «o I -mr aM\ 

JMv= 



dM v 



(9) 



(10) 



(11) 



for Aty* 6 , the magnitude of the 5 th brightest cluster. The con- 
stants on the left-hand sides of Eq. (10) and Eq. (11) were found 
by comparison with Monte-Carlo experiments where «6 clusters 
were sampled at random from the LFs. Based on 1000 such sam- 
ples for each of a variety of disruption scenarios, M c and n 6 
values, «Ai v <br = 0.75 and «Ai v <5th = 5 were found to be suit- 
able values. With these constants, Eq. (10) and (11) reproduced 
the median magnitudes of the brightest and 5 th brightest clus- 
ters within better than 0.1 mag. The Monte-Carlo experiments 
were also used to estimate the vertical error bars in Fig. 4, which 
indicate the median absolute deviation of the brightest and 5 th 
brightest cluster magnitudes for fixed «6 and M c . Note that the 
scatter in Mv is significantly smaller for the 5 th brightest cluster. 
The horizontal error bars, indicating the median absolute devi- 
ation of t, were calculated directly by integrating Eq. (8) over 
t to find the positive and negative age intervals with respect to 
the median that include half the clusters for a given Mv- The 
sizes of the horizontal error bars are given simply by the shape 
of the age distribution, and are therefore identical for the bright- 
est and 5 th brightest clusters. Because the constant-M c curves are 
fixed by the age distribution of clusters for a given magnitude, 
the model relations in the upper and lower panels differ only in 
the constant-^ (dashed) curves. These are shifted to fainter mag- 
nitudes in the lower panels, in accordance with the median mag- 
nitude difference between the brightest and 5 th brightest cluster 
for each (M c ,ne) combination. 

In the special case where there is no disruption and the 
MF is a uniform power-law, the age distribution is luminosity- 
independent. In this case, the median age of the brightest cluster 
(or any other luminosity-selected sample) will also be indepen- 
dent of luminosity and n^, as indicated by the vertical dashed 
line for the Case A scenario in Fig. 4. This is true regardlessly 
of the actual value of the MF slope a, although the specific me- 
dian age will depend on a. For Schechter MFs with a finite M c , 
the median age will instead decrease as n 6 and the luminosity of 
the brightest cluster increase, as discussed above and illustrated 
by the solid lines in the Case A panels. The introduction of sec- 
ular dissolution does not strongly affect the age distribution of 
the brightest clusters in cluster-rich systems, where these clus- 
ters will tend to be massive and long-lived. Hence, the Case A 
and Case B scenarios resemble each other for large «e- However, 
for very cluster-poor systems secular dissolution starts to have 
a noticeable effect on the age distribution of even the brightest 
clusters, and the median age shifts back towards younger ages as 
«6 becomes small. For the Case C scenario the age distribution 
is, per definition, deficient in older clusters of all masses. In this 
case the median age shifts to younger values for all n^, albeit 
with a tail towards older ages. 

For Case A and Case B, Fig. 4 shows a clear separation be- 
tween the curves for different M c , suggesting that this type of di- 
agram may indeed be used as a diagnostic tool to constrain M c . 
However, it is equally clear that much of this diagnostic power 
is lost for cluster samples that are dominated by a high degree 
of mass independent disruption, as in Case C. Due to the large 
scatter, one should not assign much significance to the location 
of any particular galaxy in Fig. 4, but for large galaxy samples 
it is still possible to draw conclusions about M c in a statistical 
sense. The inconvenience of having to include a large number 
of galaxies is counterbalanced by the fact that only the brightest 
(or few brightest) clusters have to be identified and age-dated. 
Furthermore, it is not a strict requirement that the entire galaxy 
is covered; any sub-sample of clusters can equally well be com- 
pared with the grids, as long as the sub-sample can be assumed 
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Case A: No disruption Case B: IMR=0, t 4 =1 8 yr Case C: IMR=90%, t 4 =1 15 yr 




7.0 7.5 8.0 

Log (age / yr) 



7.0 7.5 8.0 

Log (age / yr) 



7.0 7.5 8.0 
Log (age / yr) 



Fig. 4. The median absolute magnitude of the brightest (top) and 5 lh brightest (bottom) cluster versus median age. Results are shown for Schechter 
mass functions with M c = 10 4 , 10 5 , 10 6 and 10 7 M Q and a = -2 (solid curves), as well as for M c = 3x 10 5 A/ o and M c = 10 15 M o (dotted curves). 
The (red) dashed lines are for constant total number of clusters (rif, = 10 1 , 10 2 , . . . , 10 5 ) brighter than Mv = -6. The error bars mark the median 
absolute deviation of log(age) and Mv for fixed M c and n 6 . The triangles mark clusters in irregular galaxies while circles are for spirals (see 
Table 1). 



to be representative of the global cluster population in a galaxy. 
Of course, this assumption may not always be justified. 

It is important to stress that the grids in Fig. 4 are only 
suitable for comparison with galaxies for which it is plausible 
that the cluster formation rate has been approximately constant 
over the past few Gyrs. More specifically, the cluster popula- 
tion should not be dominated by a single strong burst so galaxies 
which are currently undergoing strong bursts of star formation, 
or have done so in the past (e.g. merger remnants) would require 
a custom-made grid tailored for their specific star formation his- 
tories. However, variations in the cluster formation rate of a fac- 
tor of a few are still acceptable. Among normal disc galaxies, 
which are our primary interest here, star- and cluster formation 
histories are more likely to have been roughly uniform, at least 
on average. For the Milky Way, several studies suggest that vari- 
ations of a factor of 2-3 in the star formation rate on time scales 
of 10 8 - 10 9 years (Rocha-Pinto et al. 2000; Hernandez et al. 
2000) are applicable. 

2.5. Comparison with observations 

Table 1 lists data for the brightest and 5 th brightest cluster in a 
sample of galaxies. These are mostly spirals, taken from our pre- 
vious work, with the addition of a few irregular galaxies. For the 



galaxies with data from Larsen (1999, 2002), which are ground- 
based with more or less homogeneous detection limits, the last 
column lists the total number of clusters detected in each galaxy. 
Formally, the detection limit is at Mv = -8.5 (My = -9.5 for 
clusters bluer than U—B = -0.4), but significant incompleteness 
likely sets in well above these limits. In selecting the cluster sam- 
ples in the spirals, objects with Ha emission were excluded so 
that the samples mostly consist of clusters older than about 10 
Myr. 

Overall properties of the cluster systems and the host galax- 
ies have been discussed extensively by Larsen & Richtler (2000). 
The spiral galaxies span more than an order of magnitude in ab- 
solute and area-normalised star formation rate, and are mostly 
type Sbc or later. With the exception of NGC 5194, none is in- 
volved in any strong on-going interaction. The star formation 
rates show a strong correlation with the richness of the cluster 
systems (Larsen 2002), so that the last column in Table 1 may 
also be taken as an indication of the relative star formation rates 
in the galaxies. The irregular galaxies constitute a more hetero- 
geneous sample, with at least some showing clear evidence of 
bursty star formation histories. 

A requirement in collecting data for Table 1 was that age 
estimates and V-magnitudes were available for each individual 
cluster. In a few cases where less than five clusters were de- 
tected, the columns for the 5 th brightest cluster are left empty. 
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Table 1. Data for the brightest and 5 th brightest clusters in a sample of 
galaxies. 



Galaxy 


Brij 


;htest 


5 th bri 


ghtest 


Ref " 


N* 




M v 


log(age) 


M v 


log(age) 






Irregulars: 














NGC 1569 


-13.82 


7.10 


-10.86 


7.20 


1 


- 


NGC 1705 


-13.71 


7.00 


-7.71 


6.85 


2 




NGC 4214 


-12.04 


8.93 


-9.40 


6.78 


2 




NGC 4449 


-10.51 


6.2 


-9.66 


6.2 


3 




NGC 1156 


-11.10 


6.97 


-10.12 


7.47 


4 


22 


Spirals: 














NGC 45 


-9.94 


7.10 


- 


- 


4 


3 


NGC 247 


-10.24 


7.35 


- 


- 


4 


3 


NGC 300 


-9.85 


7.24 


- 


- 


4 


3 


NGC 628 


-11.33 


7.25 


-10.55 


7.14 


4 


38 


NGC 1313 


-12.08 


7.08 


-11.16 


7.18 


4 


45 


NGC 2403 


-9.87 


7.66 


-9.44 


7.67 


4 


14 


NGC 2835 


-10.87 


7.09 


-9.88 


7.65 


4 


9 


NGC 2997 


-12.92 


7.34 


-12.23 


7.19 


4 


34 


NGC 3184 


-10.61 


7.56 


-9.13 


8.32 


4 


13 


NGC 3521 


-11.49 


6.77 


-11.10 


8.10 


5 


13 


NGC 3621 


-11.88 


7.31 


-11.06 


7.36 


4 


45 


NGC 4258 


-12.68 


6.23 


-11.17 


6.40 


5 


44 


NGC 4395 


-9.06 


8.17 


- 


- 


4 


2 


NGC 5055 


-11.38 


7.01 


-10.75 


7.14 


5 


24 


NGC 5194 


-12.82 


6.23 


-11.60 


6.65 


5 


69 


NGC 5204 


-9.55 


8.08 


-8.68 


8.68 


4 


7 


NGC 5236 


-11.70 


7.44 


-11.31 


7.08 


4 


149 


NGC 5585 


-10.82 


7.62 


-9.33 


7.84 


4 




NGC 6744 


-10.95 


7.14 


-10.09 


7.28 


4 


18 


NGC 6946 


-13.00 


7.22 


-11.71 


7.01 


4 


107 


NGC 7424 


-11.43 


7.40 


-10.76 


7.13 


4 


10 


NGC 7793 


-10.42 


7.69 


-9.85 


7.47 


4 


20 



References: 1: Anders et al. (2004), 2: Billett et al. (2002), 3: Gelatt 
et al. (2001), 4: Larsen (1999), 5: Larsen (2002) 
* Number of clusters detected 



Ages were obtained from UBV photometry, employing the S- 
sequence method (Girardi et al. 1995). The S-sequence is es- 
sentially a curve drawn through the mean colours of Large and 
Small Magellanic Cloud clusters in the (U - B, B-V) plane. 
Age increases along the curve from blue to red colours, and 
clusters which do not fall exactly on the curve are projected 
onto it along straight lines that take into account the effects of 
stochastic colour fluctuations as well as interstellar extinction. 
Alternatively, ages could have been estimated by comparing the 
observed colours with SSP models. This was also tried (using the 
BC03 models), and differences with respect to the S-sequence 
were found to be minor. The latter was preferred here due to its 
empirical foundation, although the S-sequence is strictly valid 
only for LMC-like metallicity. For further details concerning the 
properties of individual galaxies and the data reduction and anal- 
ysis, please consult the references given in Table 1. 

The spirals are overplotted with filled circles in Fig. 4 while 
the irregulars are shown with triangles. Comparing with the Case 
A and B panels, most data points fall between the M c = 10 5 M o 
and M c = 10 6 M G curves. The two most outlying data points are 
for NGC 1569 and NGC 1705, both of which have experienced 
recent bursts of star formation (Annibali et al. 2003; Angeretti 
et al. 2005) and are dominated by one or two very bright, young 
clusters. These two systems therefore provide a good illustra- 
tion of the limitations of this method. For the rest of the data 
points, a Schechter function with an M c of a few times 10 5 M o 
fits the mean trend quite well for both the brightest and 5 th - 



brightest cluster. An exception to this statement occurs at low «6 
in the Case B scenario, which may well be an indication that the 
disruption time scale assumed here is too short to match these 
galaxies. For the Case C scenario the picture is less clear. No 
single M c value can match the data for the disruption parame- 
ters assumed here. A lower IMR would alleviate this problem to 
some extent by making the (log(age), Mv) relations more simi- 
lar to those for the no-disruption scenario. For example, an IMR 
of 50% per dex over 10 9 years would provide an acceptable fit. 
Alternatively, the trend towards higher median ages in low-«6 
galaxies might be explained by invoking a lower infant mortal- 
ity rate in these galaxies specifically, causing a shallower decline 
in / S urv with age. 

Of the galaxies in Table 1, cluster disruption has only been 
studied in a few. In NGC 5194, the age-mass distribution of 
clusters older than about 10 Myr can be well fitted by a secu- 
lar evolution model with a rather short disruption time scale of 
U * 200 Myr (Bastian et al. 2005; Gieles et al. 2005). Five ad- 
ditional galaxies (NGC 45, NGC 1313, NGC 4395 NGC 5236, 
NGC 7793) have been studied by Mora et al. (2008), but their 
data did not allow a clear distinction between secular evolu- 
tion and infant mortality disruption scenarios. Without detailed 
knowledge about the cluster disruption laws in the galaxies, 
the interpretation of Fig. 4 in terms of M c remains somewhat 
ambiguous. However, it does seem clear that if the MF is a 
Schechter function, then M c cannot be much lower than a few 
times 10 5 M o in any of these galaxies, regardless of the cluster 
disruption law. More generally, any significant steepening (or 
truncation) of the MF below a few times 10 5 M G , compared to a 
power-law with a slope around -2, is ruled out. 

2.6. Radial trends 

Many properties of galaxies correlate with galactocentric dis- 
tance: gas density, star formation rate, relative velocities of spiral 
arms and stellar orbits, metallicity, etc. It is not unreasonable to 
suspect that this might also lead to radial variations in the clus- 
ter MF. So far, radial trends in the properties of (young) cluster 
populations remain poorly studied. In NGC 5194, a radially de- 
pendent bend in the LF suggests that the upper limit of the MF 
might decrease outwards (Haas et al. 2008). 

Fig. 5 shows the same Case A model grids as the left-hand 
column in Fig. 4, together with the age and magnitude of the 
brightest and 5 th brightest cluster in three radial subsamples for 
NGC 5236 and NGC 6946. These two galaxies have the largest 
number of detected clusters among those in Table 1 . The radial 
bins were defined such that each bin contains an equal number of 
clusters (49 in NGC 5236, 35 in NGC 6946). The corresponding 
radial ranges are indicated in the figure legend. As in Fig. 4, the 
scatter of the data points is well within the error bars, i.e. the data 
are consistent with a radially independent M c in both NGC 5236 
and NGC 6946. 



3. Comparing MFs directly 

The ground-based data used in Sect. 2 are, unfortunately, not 
ideal for direct studies of the MF. Visual inspection played a 
central role in the identification of cluster candidates, so com- 
pleteness effects are hard to quantify. Nevertheless, it is tempting 
to attempt a comparison of MFs derived from these data with a 
Schechter function. 

First, a purely differential comparison was carried out by 
dividing the spiral galaxies in Table 1 into a cluster-rich 
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Fig. 5. My versus log age for the brightest and 5 th brightest clusters in 
radial sub-samples in NGC 5236 and NGC 6946. In each galaxy, each 
radial bin contains an equal number of clusters. The model lines are the 
same as for the Case A scenario in Fig. 4. 



and a cluster-poor sample. The cluster-rich sample consists of 
those galaxies with more than 100 clusters each (NGC 5236, 
NGC 6946) and the cluster-poor sample of galaxies with less 
than 40 clusters each. NGC 2997 formally falls in the cluster- 
poor category, but has been excluded because it is more distant 
than most of the other galaxies in the sample. Its high specific U - 
band cluster luminosity, T L (U) = 1.45 (Larsen & Richtler 2000) 
suggests that it would probably have fallen in the cluster-rich 
sample if located at the distance of NGC 5236 or NGC 6946. 

Having an estimate of the age and My for each cluster, 
masses were derived using mass-to-light ratios from BC03. In 
principle, both the S-sequence method and a comparison with 
SSP models allow the extinction to be estimated for each in- 
dividual cluster, by determining the shift along the extinction 
vector that is required to obtain the best match between ob- 
served and model/S-sequence colours. Both methods were tried, 
and yielded small mean Ay values for both the cluster-poor and 
cluster-rich samples ((Ay) m -0.1 — 1-0.2 mag). However, the 
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Fig. 6. Combined cluster mass functions for the two galaxies with more 
than 100 clusters each (NGC 5236, NGC 6946) and those with less than 
40 clusters each. Clusters with ages < 2 x 10 s years are included. Also 
shown is the mass function for young clusters in the Antennae (Zhang 
& Fall 1999) and a Schechter function with M c = 2.1 x 10 5 M G and 
a = -2. 



scatter was large (0.4—0.6 mag for the S-sequence method and 
0.7-1.0 mag when using the SSP models) with a significant tail 
towards negative extinctions. This suggests that a correction for 
extinction for each individual cluster would introduce signifi- 
cant spurious scatter, possibly because differences between ob- 
served and model colours can have other causes (e.g., stochastic 
colour fluctuations due to the finite number of stars in a cluster, 
or simply photometric errors). A choice was therefore made to 
omit extinction corrections for individual clusters. However, the 
following analysis was, in fact, also carried out (though not de- 
scribed in detail here) for extinction-corrected cluster samples, 
with no change to the conclusions. 

The MFs for clusters younger than 200 Myr in the two spi- 
ral samples are compared in Fig. 6. This age range was chosen 
such that secular evolution can be expected to play a relatively 
minor role in most galaxies. At 200 Myr, the detection limit 
corresponds to a limiting mass of about 7 x 10 4 M o , but as in 
any cluster sample limited by a magnitude cut, this mass limit is 
strongly age-dependent and younger clusters with masses down 
to ~ 1O 4 M are present in the samples. Also shown is the MF for 
young clusters in the Antennae in the age range 25 Myr - 160 
Myr (Zhang & Fall 1999), derived assuming a distance of 19.2 
Mpc. Recently, Saviane et al. (2008) claimed a smaller distance 
of 13.3 Mpc based on the red giant branch tip, but Schweizer 
et al. (2008) find a distance of 22 Mpc based on observations of 
the Type-la supernova 2007sr and point out a number of prob- 
lems with a smaller distance. Hence, we have kept the original 
mass scale of Fall & Zhang. The apparent peaks in the MFs 
for the two spiral samples are undoubtedly due to incomplete- 
ness. The relatively higher fraction of clusters with masses be- 
low 10 5 M o in the cluster-poor spirals is most likely due the eas- 
ier identification of low-mass clusters in these galaxies, which 
tend to have lower star formation rates, lower surface brightness, 
and less crowding. Above 1O 5 M , however, the two spiral MFs 
appear remarkably similar to each other, and a Kolmogorov- 
Smirnov test confirms this impression by returning a P-value of 
0.75 when comparing the two samples. In other words, the MFs 
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in the cluster-rich and cluster-poor spirals are statistically in- 
distinguishable. Also worth noting is that the spiral MFs extend 
to nearly as massive clusters as those in the Antennae, although 
with a steeper slope. 

A maximum-likelihood fit of a Schechter function with fixed 
faint-end slope a — -2 to the combined spiral data returns a 
best-fit M c = (2.1 ± 0.4) x 10 5 M o , also indicated in Fig. 6 (ar- 
bitrarily scaled). Comparing this fit with the data, a K-S test 
returns a P-value of 0.22 for the combined spiral sample and 
P — 0.17 and P = 0.86 for the cluster-rich and cluster-poor sam- 
ples separately. In all cases the mass distributions are consistent 
with being drawn from a single Schechter function. Moreover, 
the fitted M c value appears consistent when comparing with the 
Case A and B scenarios in Fig. 4 and Fig. 5. The fit is not 
unique: for example, a Schechter function with a steeper faint- 
end slope (a = -2.2) and a higher M c — 2.5 x 1O 5 M provides an 
equally good fit. However, a Schechter function with a = — 2 and 
M c = 1O 6 M is ruled out at high confidence level; in this case 
a K-S test returns a P value of only 2 x 10~ 6 and even higher 
M c are ruled out at still higher confidence level. Similarly, M c 
values below 10 5 M Q are also ruled out. 

The MF derived here differs from that obtained by Dowell 
et al. (2008) who find power-law slopes of a ~ -1.8 for 
10 s < M/Mq < 10 7 for clusters in a sample of nearby spiral 
and irregular galaxies. The authors of this study point out that 
their cluster sample may be contaminated by blending and more 
diffuse associations because of the relatively poor resolution of 
their Sloan Digital Sky Survey data, which might lead to an over- 
estimate of the number of very massive objects. They also note 
that completeness effects may have been underestimated at the 
low-mass end, which would lead to a too shallow overall slope. 
Interestingly, however, they find similar MFs in spirals and ir- 
regulars. 

4. Implications for the luminosity function 

Luminosity functions require no conversion from observed lu- 
minosities to masses via age-dependent mass-to-light ratios, so 
it is not surprising that more information is available about LFs 
than MFs. Larsen (2002) fitted power-laws dN/dL cc LP"- to the 
LFs of young clusters in 6 spiral galaxies with HST/WFPC2 
imaging, and found slopes in the range -2 > ai > -2.4 for 
typical Mv intervals -6 > Mv > -9. A trend for the slopes 
to become steeper in brighter magnitude intervals was noted. 
Similar numbers were found for a sample of 5 spirals by Mora 
et al. (2008), using HST/ACS data. The most detailed analysis 
of the LF in a single galaxy is for NGC 5194, which has ACS 
imaging covering the entire disc. From these data, slopes be- 
tween (%l = -2.5 and ai = -2.6 have been obtained for clus- 
ters brighter than Mv = -8 (Haas et al. 2008; Hwang & Lee 
2008). This is similar to the LF slope derived by Dolphin & 
Kennicutt (2002) for clusters in NGC 3627: a L = -2.53 + 0.15 
for -8 > M v > -11. Gieles et al. (2006b) have shown that the 
LF of clusters in NGC 5194 is consistent with clusters having 
been drawn from a Schechter mass function with M c w 1O 5 M . 

The top panel in Fig. 7 shows the luminosity functions corre- 
sponding to various cluster disruption scenarios for a Schechter 
ICMF with M c = 2.1 x 10 5 M Q and a = -2, plotted as number 
of clusters per magnitude bin. LFs are shown for no disruption, 
for secular dissolution with f 4 = 200 Myr and f 4 = 1 Gyr, and 
for mass-independent disruption with IMR=90% for 10 8 years 
(i.e. the same as the Case C scenario in Sect. 2). These LFs were 
obtained by integrating Eq. (8) over all t, assuming a constant 
cluster formation rate. It is evident that the LFs all steepen to- 
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Fig. 7. Top: luminosity functions for a cluster sample with a Schechter 
mass function with M c = 2.1 x 10 5 M o . Bottom: local logarithmic slope. 
In all cases, a constant cluster formation history has been assumed and 
various disruption scenarios are applied, as indicated in the legend. 



wards the bright end and hence cannot be well approximated by a 
single power-law. Local values of the logarithmic slope, i.e., the 
equivalent of the exponent of a local power-law fit dN/dL cc L aL , 
are plotted in the bottom panel of the figure. In the absence of 
secular, mass-dependent evolution the faint-end slope of the LF 
approaches that of the ICMF. At the bright end, the LF steepens, 
while at the faint end secular dissolution of low-mass clusters 
can lead to a shallower slope. 

According to Fig. 7, LF slopes steeper than = —2 are 
generally expected for the magnitude ranges typically covered in 
studies of extragalactic star cluster populations. Comparing with 
the NGC 5 194 observations specifically, the no-disruption model 
predicts a mean slope of (ai) = -2.53 for -8 > Mv > -10, 
in excellent agreement with the studies cited above. Inclusion 
of disruption effects leads to a somewhat shallower slope, e.g. 
(ai) = -2.20 over the same Mv range for f 4 = 2 x 10 8 years. 
However, age-dependent extinction can steepen the LF slope 
(Larsen 2002) and there are indeed indications that younger clus- 
ters in M51 are more heavily extincted (Bastian et al. 2005). 



10 



S0ren S. Larsen: The mass function of star clusters 



Furthermore, the ACS mosaic for which the LF data were de- 
rived cover the whole disc where a longer disruption time scale 
than 200 Myr may be applicable, and the assumption of a con- 
stant cluster formation rate may well be invalid for M5 1 . Lastly, 
the disruption time estimates for M51 were based on the assump- 
tion of an a — -2 power-law ICMF The smaller number of high- 
mass clusters predicted for an M c = 2 x 1O 5 M Schechter func- 
tion would most strongly affect the detectable number of clusters 
at old ages, thus requiring less destruction and possibly lead to 
a longer estimate of the disruption time scale. Interestingly, the 
faint end of the Milky Way open cluster LF (-2 > Mv > -9) 
has a relatively shallow slope of a = —1.5 (van den Bergh & 
Lafontaine 1984). Due to secular dissolution, this is not neces- 
sarily inconsistent with an ICMF with low-mass slope a = -2. 
In summary, a range of LF slopes is clearly expected when fit- 
ting cluster samples in different galaxies over different magni- 
tude intervals. Generally, we conclude that an M c — 2.1 x 10 M 
Schechter MF can lead to LFs that are consistent with observa- 
tions of real cluster samples, but once again note that disruption 
and extinction effects make it difficult to draw strong conclusions 
about MFs based on observations of LFs alone. 



5. The cluster mass function in other environments 
- universal or not? 

While the analysis in the previous sections indicates than an 
M c » 2 x 10 5 M G Schechter function fits the MFs of clusters in 
both cluster-poor and cluster-rich spirals (the latter also in differ- 
ent radial bins), the data are hardly sufficient to claim that this fit 
is universally valid. In the following, we examine to what extent 
this MF is consistent with observations of star clusters in other 
environments. 



5.1. Open clusters in the Milky Way 

When discussing the Milky Way open clusters, the poorly quan- 
tified completeness of current catalogues is a severe obstacle and 
the following comparison therefore remains necessarily sketchy. 
We first estimate, for a given cluster formation rate and MF, how 
many clusters should have formed in the Galactic disc in var- 
ious mass intervals. We assume a star formation rate in bound 
clusters of 5.2 x 10~ 10 M Q yr~' pc~ 2 in the Solar neighbourhood 
(LAM05) and further approximate the Galactic disc as having 
a uniform cluster formation rate within a radius of R = 10 kpc. 
The total cluster formation rate is then ~ 0. 1 6 M Q yr~ 1 , which we 
assume to have been constant in the past. If this mass formed in 
clusters with masses sampled from a Schechter function with a 
lower mass limit of 100 M and M c = 2x 1O 5 M , the Milky Way 
has then formed about 19 000 and 700 clusters more massive 
than 10 4 M G and 10 5 M o over a 10 Gyr period, respectively. This 
corresponds to one (bound) cluster more massive than 10 5 M Q 
forming about every 13 Myr somewhere in the disc, or one more 
massive than 10 4 M Q about every ~ 500000 years. 

About a dozen star clusters with ages ^10 Myr and masses 
in the range 10 4 M o - 1O 5 M have now been found within sev- 
eral kpc from the Sun (Figer 2008). Based on the above, only 
5 clusters with M > 1O 4 M and younger than 10 Myr are ex- 
pected within 5 kpc, but the calculation can hardly be consid- 
ered more than an order-of-magnitude estimate and it is also 
possible that some of the young Milky Way clusters identified 
in current surveys will not remain bound. Hence, the observed 
numbers of young massive clusters does not seem to be in sig- 
nificant disagreement with what is expected based on the esti- 
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Fig. 8. Mass function for star clusters in the LMC younger than 1 Gyr, 
based on photometry from Bica et al. (1996). The dotted-dashed line is 
the same Schechter function as in Fig. 6, arbitrarily scaled. 



mated formation rate of bound clusters and the assumption of an 
M c = 2 x 10 5 M G Schechter MF. 

Comparison with masses 10 5 - 10 6 M o is more difficult. No 
young disc clusters in this range have been identified, but only 
~ 1 is expected to have formed in the last 10 7 years. About 
half of the total ~ 700 clusters in this mass range would al- 
ready have disrupted or have lost a large fraction of their ini- 
tial mass, assuming that the LAM05 disruption parameters were 
valid also in the past. If the remaining objects are evenly scat- 
tered across the Galactic disc, their average surface density is 
about 1 kpc -2 , so a few should be found within ~1 kpc. This is 
in fair agreement with the estimate by LAM05 (based on the cat- 
alogue of Kharchenko et al. 2005) that the most massive cluster 
formed within 600 pc over the past few Gyr had an initial mass 
of~10 5 M G . 

A 1-Gyr old cluster with an initial mass of 10 5 M Q should 
still have a present-day mass of ~60000M G . The surface density 
of clusters younger than 1 Gyr and initial masses above 10 5 M o 
should be about 0.25 kpc~ 2 so a more complete survey of clusters 
in the Galactic disc out to several kpc might reveal a significant 
population of intermediate-age, massive clusters. It is of interest 
to note that the Milky Way does host a number of several Gyr- 
old open clusters which must initially have been quite massive 
(Janes & Phelps 1994; Friel 1995). Identification of these objects 
is helped by the fact that many of them tend to be located quite 
far above the Galactic disc. 

Extrapolating further to above 1O 6 M , the Milky Way disc 
would only form one such cluster about every 40 Gyr for an 
M c = 2 x 1O 5 M Schechter MF, hence it is not surprising that 
none has been found. This latter number is in stark contrast to the 
~ 100 clusters predicted by extrapolation of an a — -2 power- 
law. 



5.2. The Large Magellanic Cloud 

The LMC has long been known to host a rich population of 
young massive star clusters (e.g. Hodge 1961), and studies of the 
LMC clusters benefit from the advantage of having a clear view 
of the entire galaxy. For the brighter clusters, current databases 
are unlikely to suffer from significant incompleteness. Excluding 
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the old GCs, the most massive clusters in the LMC have masses 
of * 2 x 1O 5 M (Hunter et al. 2003; Anders & de Grijs 2006), 
and the MF can be well fitted by a power-law with slope a = -2. 
Since the LMC cluster MF barely overlaps with the spiral data 
in Fig. 6, a direct comparison is not possible, but one can ask 
whether the LMC data are consistent with the Schechter MF de- 
rived for the spirals. To this end, the UBV photometry tabulated 
by Bica et al. (1996) was used to derive ages and masses for 
504 LMC clusters in the same way as was done for the spirals 
in Sect. 3 (except that Z = 0.008 SSP models were used, as ap- 
propriate for the LMC). A distance modulus of m - Al = 18.5 
and foreground extinction Ay - 0.25 mag were assumed. The 
resulting MF for clusters younger than 1 Gyr is shown in Fig. 8 
together with an M c = 2.1 x 10 5 M Q Schechter function. The 
Bica et al. (1996) data have an estimated completeness limit of 
V ~ 13, which corresponds to a limiting mass of ~ 10 4 M o for 
an age of 1 Gyr. The more recent datasets go deeper, but for 
comparison with the spiral data the Bica et al. photometry suf- 
fices. This analysis yields 90 clusters more massive than 10 4 M G , 
of which two exceed 10 5 M o . A K-S test comparing the LMC 
cluster MF for M > 10 4 M o with the Schechter function returns 
P = 0.94, indicating excellent agreement (although the data are 
also consistent with an untruncated power-law; P = 0.56). It can 
therefore be concluded that the LMC cluster MF can be fitted by 
the same MF derived for spirals, although there is little actual 
overlap in mass between the two samples. Thus, the impression 
that the LMC is particularly rich in "massive" clusters is most 
likely an observational selection effect. 



5.3. Starbursts and mergers 



Turning to more extreme star forming environments, there are 
indications that the MF derived for spirals may not be ade- 
quate. We have already noted from the comparison of spiral and 
Antennae cluster MFs in Fig. 6 that there are relatively more 
high-mass clusters in the Antennae sample. Indeed, a Schechter 
function fit to the Zhang & Fall data yields logM c = 5.9+°^ 
(Jordan et al. 2007), while an M c that high is ruled out at high 
confidence level for the spirals. The conditions in the Antennae 
seem to be such that M c has shifted to a higher value, so that a 
relatively larger number of high-mass clusters form. The same 
appears to be the case in M82, although analysis of the cluster 
MF there is complicated by the large amount of extinction in- 
ternally in the galaxy (Smith et al. 2007). From dynamical mass 
measurements for 15 clusters with masses between 2 x 10 5 M G 
and 4 x 10 6 M o , McCrady & Graham (2007) fit a power-law with 
a slope of a = -1.91 ± 0.06. This is again inconsistent with an 
M c - 2x 10 5 M o Schechter MF although selection effects remain 
difficult to quantify. 

The mere presence of > 10 7 M G clusters in NGC 1316, 
NGC 7252 and Arp 220 (Bastian et al. 2006; Wilson et al. 2006) 
is also inconsistent with a Schechter MF with an M c in the range 
10 5 M o - 10 6 M o . If a peak star formation rate of ~ 1000 M yr" 1 
(Bastian 2008) were turned completely into clusters, the number 
of clusters with M > 10 7 M o formed would be less than 5 x 10~ 5 
Myr~' for M c = 10 6 M o . Even if such a high SFR could be sus- 
tained for hundreds of Myrs, a negligible number of 10 7 M Q clus- 
ters would form. Hence, it seems that the cluster MF in these 
more extreme environments must be different than that observed 
in spiral discs. 



5.4. Old globular clusters 

Ultimately, it would be desirable to understand the properties 
of old GC populations in the context of their younger counter- 
parts in present-day star forming galaxies. However, classical 
GCs have been subject to dynamical evolution over a Hubble 
time, which complicates the comparison. Few clusters with ini- 
tial masses less than ~ 10 5 M o would have survived for a Hubble 
time (Vesperini & Heggie 1997; Baumgardt & Makino 2003; 
Fall & Zhang 2001) and the shape of the low-mass end of the 
globular cluster MF is therefore strongly affected by dynamical 
evolution. At higher masses, the MF is more likely to resemble 
its original shape and there is substantial evidence of a steepen- 
ing of the MF at the high-mass end in the rich GC systems of 
giant elliptical galaxies (McLaughlin & Pudritz 1996; Burkert 
& Smith 2000; Jordan et al. 2007). The "evolved Schechter 
function" fits to mass functions for GC systems in Virgo clus- 
ter galaxies by Jordan et al. (2007) indicate an M c increasing 

steadily from (3-4)xl0 5 M o in the faintest galaxies (Mv 16) 

to (2 - 3) x 10 6 M o in the giant ellipticals. Given such evidence 
of variations in M c even among old GC systems, it is not sur- 
prising that the present-day ICMF may also be environmentally 
dependent. Ultimately, understanding these variations may be a 
powerful tool for constraining the formation conditions for GC 
systems at cosmological epochs. 

6. Discussion 

The suggestion that the ICMF has a characteristic upper mass 
scale naturally raises the question as to what physical mech- 
anism is responsible. It is tempting to assume that the ICMF 
is related to the mass function of the giant molecular clouds 
(GMCs) in which cluster formation takes place. The MF of 
GMCs in the Milky Way and other Local Group galaxies can 
be fitted by a power-law with slope « -1.7, slightly shallower 
than for the cluster MF, below a fairly well defined upper limit 
of M s 6 X 10 6 M o (Williams & McKee 1997; Blitz et al. 2007; 
Rosolowsky 2007). Much of the mass in a (Galactic) GMC is 
at low density and global star formation efficiencies (SFEs) are 
typically only a few percent (Myers et al. 1986; Lada & Lada 
2003). Although it is not necessarily true that all star formation 
in a GMC results in a single cluster, the upper GMC mass limit 
and low SFEs seem roughly compatible with the M c derived in 
this paper for the ICMF in spiral galaxies. However, the local 
SFE within cluster-forming clumps in GMCs has to be high to 
produce a bound star cluster (30%-50%; Hills 1980; Elmegreen 
1983; Baumgardt & Kroupa 2007) and ultimately the mass spec- 
trum of these clumps may be more relevant. 

The formation of molecular clouds and the origin of their 
mass function remain poorly understood. The low-mass slope 
may follow from fractal structure in a turbulence-dominated in- 
terstellar medium (ISM) (Elmegreen & Falgarone 1996). It has 
often been suggested that the upper mass limit is related to 
the maximum unstable length scale in a rotating disc, due to 
shear and/or the coriolis force (Toomre 1964; Hunter et al. 1998; 
Dowell et al. 2008). Analytic arguments and detailed numeri- 
cal simulations lead to maximum cloud masses of ~ 10 7 M o , in 
rough agreement with observations (McKee & Ostriker 2007). 
The maximum cloud mass may be expected to scale with 
gas/total mass fraction and total gas mass in discs, so that more 
massive clouds should be able to form e.g. in gas-rich proto- 
galactic discs (Escala & Larson 2008). However, the picture is 
complicated by the highly dynamic character of the interstel- 
lar medium. GMCs are probably rather transient structures (Mac 
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Low & Klessen 2004; Elmegreen 2007) so it is unclear to what 
extent an overall stability criterion is applicable in general. 

What might be responsible for shifting the maximum clus- 
ter mass upwards in starburst environments? If the most mas- 
sive clusters form at similarly low global efficiencies, massive 
"Super-GMCs" (SGMCs) would be required (Harris & Pudritz 
1994; McLaughlin & Pudritz 1996). In the context of old GC 
systems, these SGMCs may be identified with Searle & Zinn 
(1978)-style fragments and cosmological simulations suggest 
that the first GCs may indeed have formed in such clouds, em- 
bedded within dark matter halos (Kravtsov & Gnedin 2005). 
Some evidence that elements of this scenario may be relevant 
in present-day star forming environments is provided by obser- 
vations of cloud complexes with masses up to nearly 10 9 M o in 
the Antennae (Wilson et al. 2003). However, the Antennae is too 
far away to study these structures in much detail with current ra- 
dio interferometers. Giant molecular associations with masses of 
10 7 - 1O 8 M , potentially suitable for forming 10 5 - 10 6 M o clus- 
ters at low overall efficiency, have been found in nearby cluster- 
rich spirals like NGC 5194 and NGC 5236 but tend to break up 
into smaller GMC-like structures when observed at high reso- 
lution (Vogel et al. 1988; Kuno et al. 1995; Rand et al. 1999; 
Lundgren et al. 2004). In general, the ISM in galaxies is hier- 
archically structured also on scales above the scale of individ- 
ual GMCs (Elmegreen & Falgarone 1996; Elmegreen 2007) and 
SGMC-like structures may appear conspicuous in the Antennae 
simply because a larger fraction of the gas is in molecular form 
there (Gao et al. 2001). In M82, the nearest starburst, obser- 
vations at high spatial resolution have failed to reveal distinct 
SGMCs (Keto et al. 2005). 

Alternatively, a shift towards higher maximum cluster mass 
could result if the global SFE within more modest-sized GMCs 
is increased. In mergers, GMCs may be compressed to high den- 
sities by an increased ambient ISM pressure as the two galaxies 
collide, leading to a strong increase in the overall star formation 
rate and formation of massive, bound clusters (Jog & Solomon 
1992; Ashman & Zepf 2001). Indeed, observations show that 
a large fraction of the interstellar medium in starburst galaxies 
(many of which are also mergers) is in a dense phase (Gao & 
Solomon 2004). There is some evidence of shock-triggered clus- 
ter formation in the Antennae, but large-scale shocks are prob- 
ably not directly responsible for most cluster formation there 
(Zhang et al. 2001). GMCs in M82 also seem to be shock- 
compressed, in this case not due to a major merger but due to 
the pressure from surrounding ionized gas (Keto et al. 2005), a 
mechanism which may also be at work in the Antennae. 

The distinction between formation of massive clusters in 
dense clumps within SGMCs and in normal-sized but com- 
pressed GMCs may be somewhat artificial in most present-day 
environments, since any dense concentration of molecular gas 
will always be surrounded by lower-density material. A more 
relevant issue may be what delivers the pressure necessary to 
confine a large amount of gas in dense clumps while star forma- 
tion is taking place (Elmegreen & Efremov 1997). Here, differ- 
ences in local conditions of the interstellar medium, such as gas 
kinematics, may be of relevance. The most actively star-forming 
spiral galaxies in our sample, like NGC 6946, have overall star 
formation rates (SFRs) which are no more than a factor of a few 
smaller than in the Antennae (Larsen 2002; Sauty et al. 1998; 
Stanford et al. 1990; Zhang et al. 2001). In spite of this high 
SFR, the kinematic situation in NGC 6946 (as in most spirals) is 
relatively quiescent. The gas resides in a large, orderly rotating 
disc (Boomsma et al. 2008) with no major distortions apart from 
a number of (probably extra-planar) high velocity clouds. The 



galaxy is among the most isolated in the nearby Universe (Pisano 
& Wilcots 2000), so its high SFR is unlikely to be interaction- 
induced. The gas kinematics in the Antennae are instead com- 
plex with a velocity spread of several hundred km s _1 , notably 
in the overlap region of the two galaxies where star formation 
is currently most active (Gao et al. 2001; Gordon et al. 2001; 
Gilbert & Graham 2007). The starburst in M82 is driving a su- 
perwind with outflow velocities in excess of 100 km s _1 , car- 
rying a significant fraction of the molecular gas in the system 
(Walter et al. 2002), again showing that a large amount of energy 
is being injected into the interstellar gas. Similar complex gas 
kinematics are observed in other starbursts (Arribas et al. 2001; 
Martin 2006). In these environments, formation of massive clus- 
ters may be favoured in regions of high gas pressure (Elmegreen 
& Efremov 1997; Keto et al. 2005). Theoretically, this is sup- 
ported by simulations of major mergers (Mihos & Hernquist 
1996; Li et al. 2004; Bournaud et al. 2008) which show that a 
larger fraction of the gas in the mergers is found in regions of 
high pressure and high velocity dispersion (> 30 km s _1 ) com- 
pared to a spiral disc. 

It is difficult to tell whether star formation in spirals and 
mergers/starbursts represent truly different modes or are part of 
a continuum. While a relatively larger fraction of the gas in vi- 
olent starbursts is in high-pressure regions, such conditions may 
also be encountered more locally in spiral discs (Elmegreen et al. 
2000; Efremov et al. 2007). Indeed, the comparison in Fig. 6 
shows that the maximum cluster mass encountered in spirals 
is not very different from that in the Antennae, although the 
high-mass slope of the ICMF appears steeper in the spirals. The 
Antennae is still in an early stage of the merging process and the 
conditions are likely to get more extreme over the next several 
10 8 years (Mihos et al. 1993) with a possible shift of M c to even 
higher values. There may well be a range of possible M c val- 
ues in present-day star forming environments and not just two 
distinct ("quiescent" and "starburst") modes. 

Some of the most massive clusters might form by a differ- 
ent mechanism altogether, such as merging of less massive clus- 
ters (Elmegreen et al. 2000; Fellhauer & Kroupa 2005). This 
would bypass the need for a mechanism to collect > 10 7 M o of 
gas in a single dense clump, but would still require many lower- 
mass clusters to form sufficiently near each other to merge. The 
merger products are predicted to be relatively diffuse so this 
mechanism is unlikely to account for more compact clusters 
such as those found in NGC 1316 and NGC 7252 (Bastian et al. 
2006). However, extremely massive objects, like the 8 x 10 7 M G 
object W3 in NGC 7252 (Maraston et al. 2004), could have 
formed this way. 

7. Summary and conclusions 

This paper provides three main lines of evidence suggesting that 
the mass function of young star clusters in spiral galaxies can be 
approximated by a Schechter function with M c = 2.1 x 10 5 M o 
and low-mass slope a = -2: 

- The observed age-luminosity relations for the brightest and 
5 th brightest cluster in a sample of mostly late-type spirals 
are consistent with those expected for cluster populations 
drawn from a Schechter MF with an M c of a few times 
10 5 M Q . This result is not significantly affected by secular 
cluster disruption. 

- A maximum-likelihood fit of a Schechter function to the 
combined MF for clusters more massive than 10 5 M G in these 
spirals yields M c = (2.1 ± 0.4) x 10 5 M o for fixed low- 
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mass slope a = -2. The cluster MFs for cluster-poor and 
cluster-rich subsamples of the galaxies are statistically indis- 
tinguishable. 

- The luminosity function for a model cluster sample with 
masses drawn from an M c = 2.1 x 10 5 M© Schechter function 
is consistent with existing data for the LFs of clusters in spi- 
ral galaxies, including the well studied M5 1 cluster system. 

An M c = 2.1 x 10 5 M Q (or M c « 1.4 x 10 5 M o for a Kroupa- or 
Chabrier IMF) Schechter function also fits the MF of clusters in 
the Large Magellanic Cloud, and it is argued that it is probably 
consistent with current surveys of young clusters in the Milky 
Way. It is, however, unlikely to be universal. In merger galaxies 
and other starbursts, a Schechter function fit requires a signifi- 
cantly higher M c than derived for the spirals. Since some spirals 
are not very different from mergers like the Antennae in terms 
of overall star formation rate or gas content, differences in the 
upper end of the cluster MFs are more likely related to the dif- 
ferent gas kinematics in these systems. The fact that M c values 
are found to be greater for old GC systems than for young clus- 
ter populations in discs suggests that star (cluster) formation in 
present-day spiral discs generally proceeds in a relatively more 
orderly way than when most old GCs formed. In the future, ob- 
servations with ALMA should provide crucial insight into the 
properties of molecular gas on scales that are directly relevant 
to formation of individual clusters in galaxies well beyond the 
Local Group. 
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